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ABSTRACT 


Various computational models are used around the world to predict the 
number of infected individuals and the death rate of the COVID-19 outbreak 
[3]. Machine learning-based models are important to take proper actions. Due 
to the ample of uncertainty and crucial data, the aerodynamic models have 
been challenged regarding higher accuracy for long-term prediction of this 
disease [1]. By researching the COVID19 problem, it is observed that lockdown 
and isolation are important techniques for preventing the spread of COVID-19 
[2]. In India, public health and the economical condition are impacted by 
COVID-19, our goal is to visualize the spread of this disease [5]. Machine 
Learning Algorithms are used in various applications for detecting adverse 
risk factors. Three ML algorithms we are using that is Logistic Regression (LR), 
Support Vector Machine (SVM), and Random Forest Classifier (RFC). These 
machine learning models are predicting the total number of recovered 


patients as per the date of each state in India [8]. 
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INTRODUCTION: 

Various drastic respiratory syndrome coronavirus 2, also 
called SARS-CoV-2, is reported as a strain of virus inducing 
the respiratory disease of COVID-19 The coronavirus disease 
is confirmed to be extremely contagious by the global 
nations and the World Health Organization (WHO). The 
COVID-19 pandemic has become an international concern 
that has recognized as a public health emergency all over the 
world. Epidemiological models have been broadly used by 
media and officials to assess the outbreak, recognize the 
peak beforehand, and also anticipate the mortality rate. 
Outburst prediction models have manifested to be elemental 
to furnish observations into the harm brought by COVID-19 
[3]. Moreover, the prediction models are taken as a Scientific 
report to construct new plans and to assess the conditions of 
curfew. The COVID-19 pandemic has been broadcasted to be 
exceedingly aggressive to spread [1]. Towards the end of 
2019, the new coronavirus spread widely in China, and a 
huge mass get infected. At present, the household upsurge 
has been productively managed, although the new 
coronavirus is escalating speedily in other areas. Presently, 
Europe has become the base of the ongoing outburst of new 
pneumonia. On March 11, the World Health Organization 
declared new pneumonia outburst a “global pandemic” [5]. 
This disease has provoked huge impact to the health and 
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economical condition of the people. The research on the 
household and international endemic and the upcoming 
evolution shift has been subject undergoing intense study 
[6]. While no model can precisely foresee the rates of 
infection and mortality, there have been efforts made to 
contemplate and inspect the strengths and drawbacks of 
severe studies and models conferred regarding the 
coronavirus. 


Background: 

We investigate the matter of modeling the latest coronavirus 
(Covid19) spread the constrained or practical scenarios in 
India. It is monitored that lock-down and isolation are the 
crucial steps to control the spread of this disease. We study 
the result of those prevention techniques mathematically 
and build a model to predict the new cases or total infected 
cases. This prediction is required to organize reports and to 
take further steps to stop preventing this disease. [2]. 
COVID-19 has been proved a global pandemic and a couple 
of tests are using various ML based models to manage the 
reports of the various cases at global level. Here, we 
showcase our model that would be able to predict the arrival 
of new cases across India. Various classification methods are 
used for making an crucial report on COVID-19 across India 


[4]. 
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Machine learning-based models have showcase the 
importance to predict the perioperative output to enhance 
the decision making on the longer-term period. Various 
prediction techniques are used to handle speculate 
problems. The study of COVID-19 shows the potential of ML 
based models to predict the number of upcoming patients 
suffering from COVID-19 [1]. 


Computational techniques for machine learning-based 
models have shown their importance for monitoring the 
potential threats for valueable decisions. Machine learning 
algorithms utilized in various applications show the 
inauspicious risk factors. The study shows the ability of 
model to estimate the number of individuals who are 
affected by this disease. This machine learning based model 
has different kinds of outcomes like the number of newly 


COVU-TY 
DATASET 


infected cases, death rates, and the recovered cases 
estimates in the next 10 days across India [8]. 


Proposed Methodology: 

We extracted Indian country data about the COVID-19 
disease from the Kaggle website Data was extracted on 4 
September 2020. Using Jupyter Notebook and Python 
libraries we merged and filtered the final analysis of the 
dataset. In this, we use Pandas, NumPy, matplpotlib, seaborn 
and, sklearn library. From the list of unique states, we get the 
names of the top five states having maximum confirmed 
cases, totally cured cases, and total new cases. Then we plot 
the facet grid for total confirmed cases concerning the state. 
We plot one feature map which helps us to do the feature 
selection. And based on feature selection, we divide the 
dataset into training and test data. 
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Here are some visualizations for total confirmed cases, cured cases, new cases, and facetgrid for each state (Confirmed Case) - 


State 
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We use three machine learning algorithms to build the 
model: 


Logistic Regression: It is an classification based algorithm 
which is used to predict the unique values. It predict the 
output in the form of binary variable Basically it predicts 
whether the particular event is going to happen or not. Its 
output falls in then range between 0 and 1 [6]. 


Support Vector Machine: It is an classification based 
method in which we plot item as an point in N-dimensional 
world (N denotes total number of features the dataset 
contains). Each point has two co-ordinates that is X and Y 
and this co-ordinates are called as support vectors [8]. 


Random Forest Classifier: The random forest classifier is a 
powerful algorithm for disease prediction and it is a 
collection of decision trees. It takes the votes from all the 
decision trees depending on the parameters we give and it 
combines all the votes of decision trees to produce a single 
output. These decision trees contains two different stages 
that is ensemble learning and feature engineering. First, it 
maps from the training data that is uniquely selected, which 
ensures that each tree predicts some values using machine 
learing protocols. After this, it collects all the predicted 
values to produce the best unique output on the basis of 
votes [7]. 


Result Analysis: In this study, we build a computational 
model that will predict the total number of recovered 
patients that are suffering from COVID-19. The dataset we 
used contains the record about total confirmed cases, total 
deaths, cured patients, new cases, and total recovered 
patients per state wise. Three machine learning algorithms 
are used to predict the number of recovered patients. 


S.NO Algorithm Accuracy 


Logistic Regression 36.74% 


Support Vector Machine | 36.74% 
Random Forest Classifier | 36.10% 





Conclusion: We have looked into the issue of Covid19 
spread in India in a big picture. A mathematical model has 
been accepted, which accompanies the actual data course of 
Covid19 spread in India. As we do lots of data visualization 
to see in the particular region how many cases are arising 
day by day and after that making use of the machine learning 
algorithms we will train our model in such a way that it will 
predict up to maximum accuracy it can do. 
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